Quick Pad Tagger: an Efficient Graphical User Interface for Building Annotated Corpora with Multiple Annotation Layers
نویسندگان
چکیده
More and more domain specific applications in the internet make use of Natural Language Processing (NLP) tools (e. g. Information Extraction systems). The output quality of these applications relies on the output quality of the used NLP tools. Often, the quality can be increased by annotating a domain specific corpus. However, annotating a corpus is a time consuming and exhaustive task. To reduce the annotation time we present a custom Graphical User Interface for different annotation layers.
منابع مشابه
Ontology-Based XQuery'ing of XML-Encoded Language Resources on Multiple Annotation Layers
We present an approach for querying collections of heterogeneous linguistic corpora that are annotated on multiple layers using arbitrary XML-based markup languages. An OWL ontology provides a homogenising view on the conceptually different markup languages so that a common querying framework can be established using the method of ontology-based query expansion. In addition, we present a highly...
متن کاملPubAnnotation-query: a search tool for corpora with multi-layers of annotation
PubAnnotation provides a convenient platform to collect and align corpora with various annotations. However, corpora must be searchable to be useful, but there has been no standard way to search corpora, particularly when multiple layers of annotations are present. PubAnnotation-query is designed to provide an interface for searching corpora annotated with multiple layers. This paper describes ...
متن کاملInteractive Corpus Annotation
We present an easy-to-use graphical tool for syntactic corpus annotation. This tool, Annotate, interacts with a part-of-speech tagger and a parser running in the background. The parser incrementally suggests single phrases bottom-up based on cascaded Markov models. A human annotator confirms or rejects the parser’s suggestions. This semi-automatic process facilitates a very rapid and efficient ...
متن کاملN.b.: A graphical user interface for annotating spoken dialogue
Corpora of transcribed and annotated dialogues are very useful for developing and evaluating the coverage of algorithms for discourse generation and interpretation and dialogue modelling. On the other hand, there is no agreement on the choice of units and conventions for annotating discourse constituents, and the annotation process can be difficult and prone to inconsistencies. This paper prese...
متن کاملFuzzy Neighbor Voting for Automatic Image Annotation
With quick development of digital images and the availability of imaging tools, massive amounts of images are created. Therefore, efficient management and suitable retrieval, especially by computers, is one of themost challenging fields in image processing. Automatic image annotation (AIA) or refers to attaching words, keywords or comments to an image or to a selected part of it. In this paper,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015